Query-Based Summarization using Rhetorical Structure Theory

نویسنده

  • Wauter Bosma
چکیده

Research on Question Answering is focused mainly on classifying the question type and finding the answer. Presenting the answer in a way that suits the user’s needs has received little attention. This paper shows how existing question answering systems—which aim at finding precise answers to questions—can be improved by exploiting summarization techniques to extract more than just the answer from the document in which the answer resides. This is done using a graph search algorithm which searches for relevant sentences in the discourse structure, which is represented as a graph. The Rhetorical Structure Theory (RST) is used to create a graph representation of a text document. The output is an extensive answer, which not only answers the question, but also gives the user an opportunity to assess the accuracy of the answer (is this what I am looking for?), and to find additional information that is related to the question, and which may satisfy an information need. This has been implemented in a working multimodal question answering system where it operates with two independently developed question answering modules.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Suitable Rhetorical Representation for Arabic Text Summarization

Text summarization based on rhetorical structure theory has shown extremely interesting result. The process of extracting the text summary from the result of the rhetorical parser is not a singleton. Different rhetorical structure trees are generated from one text. Unfortunately, the result of the generated summary is not equivalent for those trees, and the correctness of the result is affected...

متن کامل

Exploiting Rhetorical Relations in Blog Summarization

Exploiting Rhetorical Relations in Blog Summarization Shamima Mithun, Ph.D. Concordia University, 2012 With the rapid growth of the Social Web, a large amount of informal opinionated texts are available on numerous topics. Natural language tools for automatically analyzing these opinions become necessary to help individuals, organizations, and governments in making timely decisions. A query-bas...

متن کامل

Integrating Rhetorical-Semantic Relation Models for Query-Focused Summarization

We present our recent work on query-focused summarization, focusing on our efforts in building and applying models of rhetorical-semantic relations (RSRs) such as contrast and causality. We overview ongoing work in extracting and evaluating RSR models. We describe our system for query-focused summarization, focusing on an enhanced, feature-based framework. We present results of experiments to m...

متن کامل

Joint semantic discourse models for automatic multi-document summarization

Automatic multi-document summarization aims at selecting the essential content of related documents and presenting it in a summary. In this paper, we propose some methods for automatic summarization based on Rhetorical Structure Theory and Cross-document Structure Theory. They are chosen in order to properly address the relevance of information, multidocument phenomena and subtopical distributi...

متن کامل

A Hybrid Approach to Utilize Rhetorical Relations for Blog Summarization

The availability of huge amounts of online opinions has created a new need to develop effective query-based opinion summarizers to analyze this information in order to facilitate decision making at every level. To develop an effective opinion summarization approach, we have targeted to resolve specifically Question Irrelevancy and Discourse Incoherency problems which have been found to be the m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004